Outliers and data descriptions
نویسندگان
چکیده
In previous research the support vector data description (SVDD) is proposed to solve the problem of one-class classification. In one-class classification, one set of data, called the target set, has to be distinguished from the rest of the feature space. In the original optimization of the support vector data description, two parameters have to be given beforehand by the user. In this paper a new, heuristic, error is defined. Minimizing this error, both free parameters in the SVDD can be determined without the use of example outlier objects. This paper shows under what circumstances the heuristic error correlates well with the true error.
منابع مشابه
Identification of outliers types in multivariate time series using genetic algorithm
Multivariate time series data, often, modeled using vector autoregressive moving average (VARMA) model. But presence of outliers can violates the stationary assumption and may lead to wrong modeling, biased estimation of parameters and inaccurate prediction. Thus, detection of these points and how to deal properly with them, especially in relation to modeling and parameter estimation of VARMA m...
متن کاملروشهای تعیین دادههای پرت در مطالعات پزشکی
Background: An outlier is an observation that lies an abnormal distance from other values in a random sample from a population. Outliers sometimes deal with to abnormality in obtained results from collected data and information. known outlier data by researchers, physicians and other persons that work in medical fields and sciences is important and they must control data before getting result a...
متن کاملLimiting the Search Range of Correlation Stereo Using Silhouettes
We present a new approach to combine two approaches to three-dimensional reconstruction: silhouette-based and correspondence-based approaches. The two approaches have complementary costs and benefits. Silhouette-based approaches deliver volumetric descriptions which often have very few outliers, but they cannot reconstruct concave surfaces. Correspondence-based approaches give surface descripti...
متن کاملImpact of Outliers in Data Envelopment Analysis
This paper will examine the relationship between "Data Envelopment Analysis" and a statistical concept ``Outlier". Data envelopment analysis (DEA) is a method for estimating the relative efficiency of decision making units (DMUs) having similar tasks in a production system by multiple inputs to produce multiple outputs. An important issue in statistics is to identify the outliers. In this pap...
متن کاملInvestigation of outliers of evaluation scores among school of health instructors using outlier - determination indices
Introduction: Teacher evaluation, as an important strategyfor improving the quality of education, has been considered byuniversities and leads to a better understanding of the strengthsand weaknesses of education. Analysis of instructors’ scoresis one of the main fields of educational research. Since outliersaffect analysis and interpretation of information processes bothstructurally and concep...
متن کاملWho Should be Interviewed? A Response from Cluster Analysis
Objective: This article presents an application of cluster analysis for social sciences researches especially those studies that have an interview as part of their data collection. This application is more suitable for sequential mixed method researchers who use quantitative data to frame subsequent qualitative subsamples for conducting interviews. Methods: In more detail, the algorithm (i....
متن کامل